SecAudit 🛡️

AI-powered security scanner that finds vulnerabilities static tools can't — business logic flaws, auth bypasses, cross-file attack chains, and more.

Why SecAudit?

Traditional static analysis tools (Semgrep, Bearer) match patterns. SecAudit understands your code using LLMs to find vulnerabilities that require semantic reasoning:

Vulnerability Type	Semgrep CE	SecAudit
SQL Injection (basic)	✅	✅
IDOR (missing ownership checks)	❌	✅
Auth bypass (conditional logic flaws)	❌	✅
Cross-module attack chains	❌	✅
Race conditions (TOCTOU)	❌	✅
Business logic flaws	❌	✅
JWT algorithm confusion	❌	✅
CAPTCHA/verification bypass	❌	✅

Verified Results

We ran both tools on the same codebases and verified findings with real HTTP exploits against live applications:

Test	Semgrep CE	SecAudit	Verified
jsonwebtoken (CVE-2022-23529)	0 findings	11 findings	✅ Forged unsigned JWT accepted
vulnerable-express (SQLi)	0 SQL findings	SQLi + IDOR + hardcoded creds	✅ UNION injection confirmed
Juice Shop (62 route files)	15 findings	114 findings	✅ 5 exploits demonstrated

Live exploit examples (Juice Shop):

# SQL Injection → leaked 20+ user password hashes
curl "/rest/products/search?q=test'))UNION+SELECT+id,email,password,'','','','','',''FROM+Users--"

# Auth bypass → changed admin password without knowing old one
curl -X PUT "/rest/user/change-password?new=hacked&repeat=hacked" -H "Authorization: Bearer $TOKEN"

# IDOR → accessed other users' baskets
curl "/rest/basket/1" -H "Authorization: Bearer $JIM_TOKEN"  # Jim sees admin's basket

# CAPTCHA bypass → answer leaked in API response
curl "/rest/captcha"  # Returns {"answer":"60"} 🤦

How It Works

SecAudit has four analysis modes, from fast to deep:

┌─────────────────────────────────────────────────────┐
│  Static Rules (172 rules, instant, free)            │
│  └─ Regex patterns for common vulns                 │
├─────────────────────────────────────────────────────┤
│  LLM Analysis (per-file, ~$0.001/file)              │
│  └─ AI reads each file, finds semantic issues       │
├─────────────────────────────────────────────────────┤
│  Deep RLM (recursive cross-file analysis)           │
│  └─ 4-phase: recon → focused → cross-module → dedup│
├─────────────────────────────────────────────────────┤
│  REPL Mode (LLM writes & executes analysis code)    │
│  └─ AI runs Python/ripgrep/tree-sitter in Docker    │
└─────────────────────────────────────────────────────┘

Quick Start

npm install -g secaudit

# Login with ChatGPT (free, uses your subscription)
secaudit login

# Basic scan (static + LLM)
secaudit scan ./my-project

# Deep cross-file analysis
secaudit scan ./my-project --deep

# Full power: LLM executes code in Docker sandbox
secaudit scan ./my-project --deep --repl

# Static only (instant, no API calls)
secaudit scan ./my-project --no-llm

# Dependency vulnerabilities (SCA via OSV)
secaudit deps ./my-project

Installation

# Requires Node.js 18+
npm install -g secaudit

# For REPL mode, also need Docker (Colima works)
brew install colima docker
colima start

CLI Reference

secaudit scan <path>          Scan for vulnerabilities
  --no-llm                    Static rules only (no API calls)
  --deep                      Enable cross-file RLM analysis
  --repl                      Enable Docker REPL sandbox (requires Docker)
  --max-depth <n>             RLM recursion depth (default: 2)
  --max-iterations <n>        Max LLM calls (default: 30)
  --severity <level>          Minimum severity: critical|high|medium|low
  --format <fmt>              Output: terminal|json|sarif
  --baseline                  Compare against baseline
  -q, --quiet                 Suppress output
  -v, --verbose               Verbose output

secaudit deps <path>          SCA dependency scan (OSV API)
secaudit rules                List all static rules
secaudit baseline <path>      Generate baseline file
secaudit login                Authenticate with ChatGPT
secaudit verify <path>        Verify C/C++ findings with Docker + ASan

Analysis Modes

Static Rules (172 rules)

Instant regex-based scanning across 8+ languages:

JavaScript/TypeScript — SQLi, XSS, eval, prototype pollution, SSRF, open redirect
Python — Command injection, deserialization, path traversal
Go — SQL injection, weak crypto, race conditions
Java — XXE, LDAP injection, insecure deserialization
PHP — SQLi, file inclusion, command execution
Rust — Unsafe blocks, unwrap on user input
C/C++ — Buffer overflow, format string, use-after-free, TOCTOU

Every rule includes CWE ID and OWASP Top 10 2021 mapping.

LLM Analysis

AI reviews each file for issues regex can't catch:

Business logic flaws
Missing input validation
Insecure API design
Authentication/authorization gaps

Default model: gpt-5.3-codex via ChatGPT subscription (free).

Deep RLM (Recursive Language Model)

Inspired by RLM, the deep mode performs multi-phase recursive analysis:

Reconnaissance — LLM maps project structure, identifies security-critical modules
Focused Analysis — Batched deep-dive into each critical module with cross-file context
Cross-Module — Finds attack chains spanning multiple files (e.g., SQLi → auth bypass → account takeover)
Aggregation — LLM-assisted deduplication and severity ranking

REPL Mode

The LLM gets a Docker sandbox with analysis tools and writes code to audit your codebase:

# The LLM actually runs code like this in Docker:
import subprocess
result = subprocess.run(["rg", "-n", "req.body.UserId", "/code"], capture_output=True, text=True)
# Finds that UserId comes from user input → traces to DB queries → confirms IDOR

Available in sandbox: Python 3.12, ripgrep, tree-sitter, networkx, jedi.

Configuration

Create .secaudit.yml in your project root:

severity: medium          # Minimum severity to report
format: terminal          # Output format

rules:
  disable:                # Rules to skip
    - AUTH_HARDCODED_PASSWORD
  enable: []              # Only run these rules

ignore:                   # Paths to exclude
  - "test/**"
  - "vendor/**"

llm:
  provider: openai-codex  # LLM provider
  model: gpt-5.3-codex    # Model
  concurrency: 5          # Parallel file analysis

Inline Suppression

// secaudit-disable-next-line SQL_TEMPLATE_LITERAL
const query = `SELECT * FROM users WHERE id = ${id}`;

// secaudit-disable-file
// (disables all rules for this file)

Output Formats

Terminal (default)

  login.ts
     CRIT   User-controlled input directly concatenated into SQL query
           Line 34 · SQL Injection · SQL_TEMPLATE_LITERAL · CWE-89 · A03:2021

JSON

secaudit scan . --format json

SARIF (for GitHub Code Scanning)

secaudit scan . --format sarif > results.sarif

GitHub Action

- uses: BoxiYu/SecAudit@main
  with:
    path: ./src
    severity: high
    format: sarif

SCA (Software Composition Analysis)

Scans dependencies for known vulnerabilities via the OSV API:

secaudit deps .

Supports: package-lock.json, yarn.lock, pnpm-lock.yaml, requirements.txt, Pipfile.lock, go.sum, Cargo.lock, Gemfile.lock, composer.lock, pom.xml.

How It Compares

Feature	SecAudit	Semgrep CE	Bearer	CodeQL
Static rules	172	20,000+	1,000+	2,000+
Business logic detection	✅	❌	❌	❌
Cross-file analysis	✅	❌	Partial	✅
IDOR / auth gap detection	✅	❌	❌	❌
Race condition detection	✅	❌	❌	❌
Attack chain discovery	✅	❌	❌	❌
LLM-powered analysis	✅	Paid only	❌	❌
Code execution sandbox	✅	❌	❌	❌
SCA	✅ (OSV)	Paid	❌	❌
Speed	Minutes	Seconds	Seconds	Minutes
Cost	Free*	Free	Free	Free
Languages	8+	30+	12+	10+

* Uses your ChatGPT subscription for LLM features. Static rules are always free.

Architecture

src/
├── cli.ts                 # CLI entry point (commander)
├── types.ts               # Finding, Severity, ScanResult types
├── config.ts              # .secaudit.yml config loader
├── providers/
│   └── pi-ai.ts           # LLM provider abstraction (@mariozechner/pi-ai)
├── scanner/
│   ├── static.ts          # Static rule engine
│   ├── llm.ts             # Per-file LLM analysis
│   ├── deep-llm.ts        # 4-phase RLM deep analysis
│   ├── rlm-engine.ts      # Recursive LLM engine core
│   ├── rlm-repl.ts        # Docker REPL container management
│   ├── rlm-docker.ts      # Docker image builder
│   ├── sca.ts             # OSV dependency scanner
│   ├── git-history.ts     # Git commit analysis
│   ├── sandbox.ts         # C/C++ verification sandbox
│   └── rules/             # 172 static rules (23 categories)
├── reporters/
│   ├── terminal.ts        # Terminal output with colors
│   ├── json.ts            # JSON reporter
│   └── sarif.ts           # SARIF reporter
└── index.ts               # Public API exports

License

MIT — see LICENSE.

Contributing

Issues and PRs welcome. This project is in active development.

Credits

LLM provider abstraction: @mariozechner/pi-ai
Recursive analysis inspired by: RLM
SCA data: OSV.dev
Benchmark target: OWASP Juice Shop

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
action		action
benchmark		benchmark
skill		skill
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
DESIGN.md		DESIGN.md
EVMBENCH_STRATEGY.md		EVMBENCH_STRATEGY.md
LICENSE		LICENSE
README.md		README.md
UPGRADE_PLAN.md		UPGRADE_PLAN.md
evmbench-judge.mjs		evmbench-judge.mjs
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SecAudit 🛡️

Why SecAudit?

Verified Results

Live exploit examples (Juice Shop):

How It Works

Quick Start

Installation

CLI Reference

Analysis Modes

Static Rules (172 rules)

LLM Analysis

Deep RLM (Recursive Language Model)

REPL Mode

Configuration

Inline Suppression

Output Formats

Terminal (default)

JSON

SARIF (for GitHub Code Scanning)

GitHub Action

SCA (Software Composition Analysis)

How It Compares

Architecture

License

Contributing

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SecAudit 🛡️

Why SecAudit?

Verified Results

Live exploit examples (Juice Shop):

How It Works

Quick Start

Installation

CLI Reference

Analysis Modes

Static Rules (172 rules)

LLM Analysis

Deep RLM (Recursive Language Model)

REPL Mode

Configuration

Inline Suppression

Output Formats

Terminal (default)

JSON

SARIF (for GitHub Code Scanning)

GitHub Action

SCA (Software Composition Analysis)

How It Compares

Architecture

License

Contributing

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages